Employing Cognitive Notions in Multilingual Summarization of News Reports

نویسنده

  • Barbara Gawronska
چکیده

The paper presents an approach to automatic text understanding inspired by speech act theory and cognitive semantics, especially by the notion of ‘mental spaces’ (Fauconnier 1985), and by Pustejovsky’s (1991a, 1991b, 1995) notion of ‘qualia’ and his definition of formal and telic hyponymy. This approach is employed in an experimental system for understanding of news reports and multilingual generation of news summaries. The system, aims at analyses of English news reports in the domain of the world’s news and generation of summaries in Swedish, Danish, and Polish. The paper focuses on the understanding component and on the possibility of using WordNet as the main lexical knowledge resource for English. An appropriate semantic analysis requires some modifications of the hyper-/hyponymy and holo-/meronymy relations that are encoded in WordNet. A combination of a cognitive analysis of certain lexical and phrasal categories with qualia-based re-formulations of WordNet hierarchies is proposed and tested.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Platform for Multilingual News Summarization

We have developed a multilingual version of Columbia Newsblaster as a testbed for multilingual multi-document summarization. The system collects, clusters, and summarizes news documents from sources all over the world daily. It crawls news sites in many different countries, written in different languages, extracts the news text from the HTML pages, uses a variety of methods to translate the doc...

متن کامل

Defining and Classifying Space Builders for Information Extraction

The paper addresses the question of Information Extraction aimed at multilingual text generation, or text re-writing. This method provides an alternative to traditional Machine Translation, but is also related to text summarization. Given a source text, a re-writing system selects and structures the textual information in order to generate a “content report”. The present approach is inspired by...

متن کامل

A Multilingual News Summarizer

Huge multilingual news articles are reported and disseminated on the Internet. How to extract the key information and save the reading time is a crucial issue. This paper proposes architecture of multilingual news summarizer, including monolingual and multilingual clustering, similarity measure among meaningful units, and presentation of summarization results. Translation among news stories, id...

متن کامل

A Muitilingual News Summarizer

Huge multilingual news articles are reported and disseminated on the Internet. ltow to extract the kcy information and savc the reading time is a crucial issue. This paper proposes architecture of multilingual news sumlnarizer, including monolingual and multilingual clustering, similarity measure among lneaningful ullits, and presentation of summarization results. Translation anlong news storie...

متن کامل

ACL 2013 MultiLing Pilot Overview

The 2013 Association for Computational Linguistics MultiLing Pilot posed a task to measure the performance of multilingual, single-document, summarization systems using a dataset derived from many Wikipedias. The objective of the pilot was to assess automatic summarization of multilingual text documents outside the news domain and the potential of using Wikipedia articles for such research. Thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002